NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Federation Strikes Back: A Survey of Federated Learning Privacy Attacks, Defenses, Applications, and Policy Landscape

https://doi.org/10.1145/3724113

Zhao, Joshua; Bagchi, Saurabh; Avestimehr, Salman; Chan, Kevin; Chaterji, Somali; Dimitriadis, Dimitris; Li, Jiacheng; Li, Ninghui; Nourian, Arash; Roth, Holger (September 2025, ACM Computing Surveys)

Deep learning has shown incredible potential across a wide array of tasks, and accompanied by this growth has been an insatiable appetite for data. However, a large amount of data needed for enabling deep learning is stored on personal devices, and recent concerns on privacy have further highlighted challenges for accessing such data. As a result, federated learning (FL) has emerged as an important privacy-preserving technology that enables collaborative training of machine learning models without the need to send the raw, potentially sensitive, data to a central server. However, the fundamental premise that sending model updates to a server is privacy-preserving only holds if the updates cannot be “reverse engineered” to infer information about the private training data. It has been shown under a wide variety of settings that this privacy premise doesnothold. In this article we provide a comprehensive literature review of the different privacy attacks and defense methods in FL. We identify the current limitations of these attacks and highlight the settings in which the privacy of an FL client can be broken. We further dissect some of the successful industry applications of FL and draw lessons for future successful adoption. We survey the emerging landscape of privacy regulation for FL and conclude with future directions for taking FL toward the cherished goal of generating accurate models while preserving the privacy of the data from its participants.
more » « less
Free, publicly-accessible full text available September 30, 2026
Leak and Learn: An Attacker's Cookbook to Train Using Leaked Data from Federated Learning

https://doi.org/10.1109/CVPR52733.2024.01164

Zhao, Joshua C; Dabholkar, Ahaan; Sharma, Atul; Bagchi, Saurabh (June 2024, IEEE/CVF CVPR)

Full Text Available
Loki: Large-scale Data Reconstruction Attack against Federated Learning through Model Manipulation

https://doi.org/10.1109/SP54263.2024.00030

Zhao, Joshua C; Sharma, Atul; Elkordy, Ahmed Roushdy; Ezzeldin, Yahya H; Avestimehr, Salman; Bagchi, Saurabh (May 2024, IEEE Security and Privacy)

Full Text Available
FLAIR: Defense against Model Poisoning Attack in Federated Learning

Sharma, Atul; Chen, Wei; Zhao, Joshua; Qiu, Qiang; Bagchi, Saurabh; Chaterji, Somali. (July 2023, ACM ASIA CCS)

Federated learning—multi-party, distributed learning in a decentralized environment—is vulnerable to model poisoning attacks, more so than centralized learning. This is because malicious clients can collude and send in carefully tailored model updates to make the global model inaccurate. This motivated the development of Byzantine-resilient federated learning algorithms, such as Krum, Bulyan, FABA, and FoolsGold. However, a recently developed untargeted model poisoning attack showed that all prior defenses can be bypassed. The attack uses the intuition that simply by changing the sign of the gradient updates that the optimizer is computing, for a set of malicious clients, a model can be diverted from the optima to increase the test error rate. In this work, we develop FLAIR—a defense against this directed deviation attack (DDA), a state-of-the-art model poisoning attack. FLAIR is based on ourintuition that in federated learning, certain patterns of gradient flips are indicative of an attack. This intuition is remarkably stable across different learning algorithms, models, and datasets. FLAIR assigns reputation scores to the participating clients based on their behavior during the training phase and then takes a weighted contribution of the clients. We show that where the existing defense baselines of FABA [IJCAI’19], FoolsGold [Usenix ’20], and FLTrust [NDSS ’21] fail when 20-30% of the clients are malicious, FLAIR provides byzantine-robustness upto a malicious client percentage of 45%. We also show that FLAIR provides robustness against even a white-box version of DDA.
more » « less
Full Text Available
How to learn collaboratively - Federated learning to peer-to-peer learning and what's at stake.

Sharma, Atul; Zhao, Joshua; Chen, Wei; Qiu, Qiang; Bagchi, Saurabh; Chaterji, Somali (June 2023, 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), “Disrupt 23: Disruptive Ideas and New Interdisciplinary Results” Track)

Standard ML relies on training using a centrally collected dataset, while collaborative learning techniques such as Federated Learning (FL) enable data to remain decentralized at client locations. In FL, a central server coordinates the training process, reducing computation and communication expenses for clients. However, this centralization can lead to server congestion and heightened risk of malicious activity or data privacy breaches. In contrast, Peer-to-Peer Learning (P2PL) is a fully decentralized system where nodes manage both local training and aggregation tasks. While P2PL promotes privacy by eliminating the need to trust a single node, it also results in increased computation and communication costs, along with potential difficulties in achieving consensus among nodes. To address the limitations of both FL and P2PL, we propose a hybrid approach called Hubs-and-Spokes Learning (HSL). In HSL, hubs function similarly to FL servers, maintaining consensus but exerting less control over spokes. This paper argues that HSL’s design allows for greater availability and privacy than FL, while reducing computation and communication costs compared to P2PL. Additionally, HSL maintains consensus and integrity in the learning process.
more » « less
Full Text Available
How to Learn Collaboratively - Federated Learning to Peer-to-Peer Learning and What’s at Stake

https://doi.org/10.1109/DSN-S58398.2023.00036

Sharma, Atul; Zhao, Joshua C; Chen, Wei; Qiu, Qiang; Bagchi, Saurabh; Chaterji, Somali (June 2023, IEEE/IFIP DSN)

Full Text Available

Search for: All records